Perceptual Coding of Audio Signals Using Adaptive Time-Frequency Transform
نویسندگان
چکیده
Wide band digital audio signals have a very high data-rate associated with them due to their complex nature and demand for highquality reproduction. Although recent technological advancements have significantly reduced the cost of bandwidth and miniaturized storage facilities, the rapid increase in the volume of digital audio content constantly compels the need for better compression algorithms. Over the years various perceptually lossless compression techniques have been introduced, and transform-based compression techniques have made a significant impact in recent years. In this paper, we propose one such transform-based compression technique, where the joint time-frequency (TF) properties of the nonstationary nature of the audio signals were exploited in creating a compact energy representation of the signal in fewer coefficients. The decomposition coefficients were processed and perceptually filtered to retain only the relevant coefficients. Perceptual filtering (psychoacoustics) was applied in a novel way by analyzing and performing TF specific psychoacoustics experiments. An added advantage of the proposed technique is that, due to its signal adaptive nature, it does not need predetermined segmentation of audio signals for processing. Eight stereo audio signal samples of different varieties were used in the study. Subjective (mean opinion score—MOS) listening tests were performed and the subjective difference grades (SDG) were used to compare the performance of the proposed coder with MP3, AAC, and HE-AAC encoders. Compression ratios in the range of 8 to 40 were achieved by the proposed technique with subjective difference grades (SDG) ranging from –0.53 to –2.27.
منابع مشابه
An adaptive wavelet-based approach for perceptual low bit rate audio coding attending to entropy-type criteria
This paper outlines an adaptive wavelet-based perceptual audio coding scheme attending to various entropy-type criteria. Its performance using some different wavelet families and various filter lengths and decomposition depths has also been investigated. An optimal choice of these parameters is accomplished in order to evaluate both quality and bit rate of compressed signals for four different ...
متن کاملPerceptual Coding of Audio Using Signal - AdaptiveFilterbanks
This thesis studies the application of signal-adaptive lter banks in perceptual coding of audio with an emphasis on Wavelet Filter Banks (WFB). It provides an overview of perceptual coding of audio, the motivating psychoacoustic principles, transforms and wavelet theory. Additionally, di erent existing wavelets-based audio coding schemes are presented. The aim of most of the schemes is to overc...
متن کاملTree and filter optimization for audio compression in a wavelet-based perceptual audio coder
This paper outlines a new perceptual low bit rate audio coding scheme based on adapted wavelet representations. It claims wavelet tree and filter adaptation attending to a perceptual entropy-based method. To achieve such adaptive structure, a periodized wavelet packet transform is performed for each audio frame. After the transform, the encoder employs scalar adaptive quantization, controlled b...
متن کاملAn Adaptive Segmentation Method Using Fractal Dimension and Wavelet Transform
In analyzing a signal, especially a non-stationary signal, it is often necessary the desired signal to be segmented into small epochs. Segmentation can be performed by splitting the signal at time instances where signal amplitude or frequency change. In this paper, the signal is initially decomposed into signals with different frequency bands using wavelet transform. Then, fractal dimension of ...
متن کاملAn Adaptive Segmentation Method Using Fractal Dimension and Wavelet Transform
In analyzing a signal, especially a non-stationary signal, it is often necessary the desired signal to be segmented into small epochs. Segmentation can be performed by splitting the signal at time instances where signal amplitude or frequency change. In this paper, the signal is initially decomposed into signals with different frequency bands using wavelet transform. Then, fractal dimension of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Audio, Speech and Music Processing
دوره 2007 شماره
صفحات -
تاریخ انتشار 2007